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Abstract. In this note, we study the n x n random Euclidean matrix whose entry is 
equal to — Xj\\) for some function / and the Xi's are i.i.d. isotropic vectors in W. In 

the regime where n and p both grow to infinity and are proportional, we give some sufficient 
conditions for the empirical distribution of the eigenvalues to converge weakly. We illustrate 
our result on log-concave random vectors. 

<U , 1. Introduction 

Let Y be an isotropic random vector in W, i.e. EY = 0, EfYK ] = I/p, where I is the 
identity matrix. Let (X±, ■ ■ ■ ,X n ) be independent copies of Y. We define the n x n matrix A 
by, for all 1 ^ i,j ^ n, 

A ij = f(\\X i -X j \\ 2 ), 

where / : [0, oo) — > R is a measurable function and || • || denotes the Euclidean norm. The matrix 
A is a random Euclidean matrix. It has already attracted some attention see e.g. Mezard, Parisi 
and Zhee |16j . Vershik [18] or Bordenave [7] and references therein. 

If B is a symmetric matrix of size n, then its eigenvalues, say Xi(B), • • • , X n (B) are real. The 
empirical spectral distribution (ESD) of B is classically defined as 

1 n 

1=1 

where 5 X is the Dirac delta function at x. In this note, we are interested in the asymptotic con- 
vergence of fiA as p and n converge to +oo. This regime has notably been previously considered 
in El Karoui |10| and Do and Vu [9]. More precisely, we fix a sequence p(n) such that 

pin) , 

lim ^—^ = ye (0,oo). (1) 

Throughout this note, we consider, on a common probability space, an array of random variables 
(A^.(ra))i^fc^ n such that (Xi(n), ■ ■ ■ ,X n (n)) are independent copies of Y(n), an isotropic vector 
in M p ( n ). For each n, we define the Euclidean matrix A(n) associated. For ease of notation, we 
will often remove the explicit dependence in n: we write p, Y, X^ or A in place of p(n), Y(n), 
Xfc(n) or A(n). 

The Marcenko-Pastur probability distribution with parameter 1/y is given by 

y 

u MP (dx) = (1 - y) + 6 (dx) + (v+ x )( x y-)^[y-, y+ \{x)dx, 

where x + = (x V 0), y± = (1 ± -^) 2 and dx denotes the Lebesgue measure. Since the celebrated 

paper of Marcenko and Pastur |15j . this distribution is known to be closely related to empirical 
covariance matrices in high-dimension. 
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We say that Y has a log-concave distribution, if Y has a density on W p which is log-concave. 
Log-concave random vectors have an increasing importance in convex geometry, probability and 
statistics (see e.g. Barthe [S])- We will prove the following result. 

Theorem l.IfY has a log-concave distribution and f is three times differentiable at 2, then, 
almost surely, as n — >• oo, fiA converges weakly to \i, the law o//(0) — f(2) + 2/'(2) — 2/' (2)5, 
where S has distribution vmp- 

With the weaker assumption that / is differentiable at 2, Theorem [1] is conjectured in Do and 
Vu Their conjecture has motivated this note. It would follow from the thin-shell hypothesis 
which asserts that there exists c > 0, such that for any isotropic log-concave vector Y in W, 
E(||y|| — l) 2 ^ c/p (see Anttila, Ball and Perissinaki [3j and Bobkov and Koldobsky [B]). Klartag 
|14| has proved the thin-shell hypothesis for isotropic unconditional log-concave vectors. 

The proof of Theorem [1] will rely on two recent results on log-concave vectors. Let X = X(n) 
be the n x n matrix with columns given by (Xx(n), • • • , X n (n)). Pajor and Pastur have proved 
the following : 

Theorem 2 (|17j). IfY has a log-concave distribution, then, in probability, as n — > oo, Hx T x 
converges weakly to vmp- 

We will also rely on a theorem due to Guedon and Millman. 

Theorem 3 (|12j). There exist positive constants cq,c\ such that ifY is an isotropic log-concave 
vector in MP, for any t ^ 0, 



With Theorems [2] and [3] in hand, the heuristic behind Theorem[T]is simple. Theorem [3] implies 
that \\Xi\\ 2 ~ 1 with high probability. Hence, since — Xj\\ 2 = \\Xi\\ 2 + ||X,-|| 2 — 2XfXj, a 
Taylor expansion of / around 2 gives 



where / is the identity matrix and J is the matrix with all entries equal to 1. From Theorem 
El \ 1 >x T X converges weakly to vmp- Moreover, since J has rank one, it is negligible for the weak 
convergence of ESD. It follows that \im is close to [i. The actual proof of Theorem [1] will be 
elementary and it will follow this heuristic. We shall use some standard perturbation inequalities 
for the eigenvalues. The idea to perform a Taylor expansion was already central in |10|. [9] . 

Beyond Theorems [2]l3l the proof of Theorem [1] is not related to log-concave vectors. In fact, 
it is nearly always possible to linearize / as soon as the norms of the vectors concentrate around 
their mean. More precisely, let us say that two sequences of probability measures (jU n )> (^n)j 
are asymptotically weakly equal, if for any bounded continuous function /, J fdfj, n — J fdv n 
converges to 0. 

Theorem 4. Assume that there exists an integer I ^ 1 such that E|||y|| — l\ 2i = 0{p~ l ), and 
that for any e > 0, 



F(\\\Y\\ - 1| > t) < ciexp (-coy/p(t At 3 )). 




In other words, the matrix A is close to the matrix 



M = (/(0) - f{2) + 2/'(2))J + f{2)J - 2f'(2)X T X, 




(3) 
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Then, if f is £ times differentiable at 2, almost surely, ha is asymptotically weakly equal to the 
law o//(0) - f{2) + 2/'(2) - 2f'(2)S, where S has distribution Eh x t x - 

The case £ = 1 of the above statement is contained in Do and Vu [9,, Theorem 5]. Besides 
Theorem [21 some general conditions on the matrix X guarantee the convergence of Hx T Xi see 
Yin and Krishnaiah [19J, Gotze and Tikhomirov [11] or Adamczak pQ. 



2. Proofs 



2.1. Perturbation inequalities. We first recall some basic perturbation inequalities of eigen- 
values and introduce a good notion of distances for ESD. For /i, v two real probability measures, 
the Kolmogorov-Smirnov distance can be defined as 

diesis v ) = sup {/ f dfJj ~ J f du : WfW BV ^ 1 j > 

where, for / : M — > R, the bounded variation norm is H/Hw = sup^ fcgZ \ f( x k+i) — f( x k)\, and 
the supremum is over all real increasing sequence (xfc)fcez- The following inequality is a classical 
consequence of the interlacing of eigenvalues (see e.g. Bai and Silverstein jjl Theorem A.43]). 

Lemma 5 (Rank inequality). If B, C are n x n Hermitian matrices, then, 

rank(5 - C) 



dKs{^B,^c) < 



n 



For p ^ 1, let /i, v be two real probability measures such that J \x\ p dfi and f \x\ p dv are finite. 
We define the LP -Wasser stein distance as 



Wp(/i, v) = ( inf / \x — y\ p dir\ 



where the infimum is over all coupling tt of jj, and v (i.e. ir is probability measure on K x K 
whose first marginal is equal to \x and second marginal is equal to v). Holder inequality implies 
that for 1 ^ p ^ q, W p ^ W q . Moreover, the Kantorovich-Rubinstein duality gives a variational 
expression for W\\ 



Wl(/i, v) = sup I J fdfi - J fdu : 



< 1 



where ||/||l = sup^,^ \f{x) — f(y)\/\x — y\ is the Lipschitz constant of /. The next classical 
inequality is particularly useful (see e.g. Anderson, Guionnet and Zeitouni [21 Lemma 2.1.19]). 

Lemma 6 (Hoeffman-Wielandt inequality). If B, C are n x n Hermitian matrices, then 



W 2 ( f i B , f i c ) < \/-tr(B-C) 2 . 
n 



We finally introduce the distance 

d(ji, v) = sup I J fdfi - J fdu: \\f\\ L < 1 and \\f\\ BV < 1 
By Lemmas [5] and [6l we obtain that for any nx n Hermitian matrices B, C, 



./(/„,,/„■)< { lhr(B-C)iA Tank{B - C) . (4) 
n n 



Notice that d([i n ,fi) — > implies that [i n converges weakly to /j,. 
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2.2. Concentration inequality. For x = (x\, ■ ■ ■ , x n ) E M Ptn (M), define a{x) as the Euclidean 
matrix obtained from the columns of x : a(x)ij = f(\\xi—Xj\\ 2 ). In particular, we have A = a(X). 
Let i £ {1, • • • ,n}, x' = (x' l5 • • • ,x' n ) G .A/fp in (IR) and assume that x'j = Xj for all j / i. Then 
a(x) and a(x') have all entries equal but the entries on the i-th. row or column. We get 

rank(a(x) — a(x)) ^ 2. 

It thus follows from Lemma [5] that for any function / with H/Hsy < 00 j 

^ 2||/||b^ 

n 

Using Azuma-Hoeffding's inequality, it is then straightforward to check that for any t ^ 0, 

/d/M - E J fdfi A > ?j < exp gjj^;) • (5) 

(For a proof, see [8j proof of Lemma C.2] or Guntuboyina and Leeb |13j). Using the Borel- 
Cantelli Lemma, this shows that for any such function /, a.s. 



fdVa{x) - / fdfJ- a (x') 



fdfJL A - J fdEflA -+ 0. 

Now, recall that M was defined by ([2]). Since the matrix J has rank one, from Theorem [2] 
and Lemma O E/iM converges weakly to fi. Hence our Theorem [1] is a corollary of the following 
proposition. 

Proposition 7. Under the assumptions of Theorem^ we have 

lim G?(E//yi,E/ijw) = 0. 

2.3. Proof of Proposition O The idea is to perform a multiple Taylor expansion which takes 
the best out of (@J). 

Step 1 : concentration of norms. By assumption, there exists an open interval K = (2 — 5,2 + 5) 
such that / is C 1 in K and, for any x £ K, 

f"(1\ f"'(2) 
f(x) = f(2) + f'(2)(x -2) + J -±l( x - 2 f + L±1( X _ 2)3(1 + (i)). 

For any i ^ j, (Xi — Xj)/y2 is an isotropic log-concave vector. Define the sequence e(n) = 
n~ K A (5/2) with < K < 1/6. It follows from Theorem [3] and the union bound that the event 

8 = |max{|||X i -X j || 2 -2| v|||Xi|| 2 -l|} <e(n)| 

has probability tending to 1 as n goes to infinity. 



Step 2 : Taylor expansion around \\Xi\\ 2 + ||X,-|| 2 . We consider the matrix 

_ f /(||X i || 2 + ||X,|| 2 )-2/'(||X J || 2 + ||X J || 2 )Xf^ ifi^j 
y '"l/(0) ifi = j. 

On the event S , ||^Q|| 2 + ||^j|| 2 £ K. Since / is C 1 in K, we may perform a Taylor expansion 
of f(\\Xi - Xj\\ 2 ) around ||A^|| 2 + \\Xj\\ 2 . It follows that for i / j, 



\Aij - Bij\ = o(\\Xi - Xjf - \\Xi\\ 2 - \\Xjf) ^ 5(n)\xJX 3 



:> I • 
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where 5{n) is a sequence going to 0. From ^ and Jensen's inequality, we get 
d(En A ,Efi B ) < Ed(vA,» B ) < P(«f c ) + ^ Y^E\Aij - By pig J 



1/2 



< P(£ c ) + 5(n)(nE|X 1 T X 2 | 



1/2 



Now, from the assumption that X\ and X 2 are independent and isotropic, we find 
E|xfX 2 | 2 = E^IhJ =X>^ 2 i) 2 = -- 

Vfc=i / fc=i ^ 



By assumption ([T|), we deduce that 

lim d(E/j,A,Efj, B ) = 0. 

n— Yoo 

It thus remains to compare E/u_b and E/x^/. 

S'tep 5 : Taylor expansion around 2. We define the matrix 

_ / /(||X l || 2 + ||X J || 2 )-2/'(2)XfX J ifi/j 
I /(0) if * = J- 

We now use the fact that /' is locally Lipschitz at 2. It follows that if £ holds, for % ^ j, 

\Bij - Cij\ = 0{XfXj{\\X i f + \\X 3 \\ 2 - 2)) < ce{n)\XjXj\. 
The argument of step 2 implies that 

lim d(E/iB,E/Ltc) = 0. 
It thus remains to compare E/xc and E/iM- 

Step 4 ■' Taylor expansion around 2 again. We now consider the matrix 

' /(2) + f (2)(||^|| 2 + ||X,|| 2 - 2) + ^(||X 4 || 2 + IIX.-II 2 - 2) 2 

+ ^l(\\X l \\ 2 + \\Xjf - 2) 3 - 2f'(2)XfX j ifi^j 
/(0) if i = j. 



We are going to prove that 

lim d(Efic,EfJL D ) = 0. (6) 
n—too 

We perform a Taylor expansion of order 3 of /(||Xj|| 2 + ||X,-|| 2 ) around 2. It follows that if £ 
holds, for i ^ j, 

\C i:j - = o(\\Xi\\ 2 + \\Xjf - 2) 3 ^ <5(n)|||*i|| 2 + \\XjW 2 - 2| 3 , 

where 6(n) is a sequence going to 0. Using and arguing as in step 2, in order to prove ©, 
it thus suffices to show that 



- ^EHI^II 2 + \\XjW 2 - 2fl £ = O(l) 



n . 



Since, for I ^ 1, \x + y\ ^ 2 e 1 (\x\ e + \y\ e ), it is sufficient to show that 

nl(||Xi|| 2 -l) 6 l g = 0(1). 
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To this end, for integer i 1, we write 

E|||Xi|| 2 - = E|||Xi|| - l| £ |||Xi|| + l\ e l £ < 3 € E|||Xi|| - l\ e . 

Then, Theorem [3] implies that there exists eg such that 

E|||Xi|| - l\ e < c e p- £/6 . 

It follows that 

E|||Xif-l|*l £ = o(p^/ 6 V (7) 
This proves ([6]). It finally remains to compare E//d and E/^jyf 

Step 5 : End of proof. We set 

= (||JQ|| 2 -1). 

We note that for i ^ j, 

Dij = Mij + c keZiZj, 

for some coefficients c^i depending on /' (2), f"(2), f"'(2). Note that c\q = coi = /'(2). Similarly, 

Da = Ma + 2/' (2)^ = M„ + cio^ + coi^. 
Define the matrix E, for all 1 ^ i,j ^ n, 

If £ holds, then maxj |^| ^ e(n) and we find 



l-^ij ~ Ail = l(i = 
It follows from (ffl that 



<cl(i =J>(n) 2 . 



1/2 



^ P(£ c ) + ce(n) 2 . 

We deduce that 

lim d(Efi D ,Efi E ) = 0. 

n— >oo 

We notice finally that the matrix E 1 — M is equal to 

where is the vector with coordinates (^)i^i^n- It implies in particular that rank(£' — M) sj 9, 
indeed the rank is subadditive and rank(Zj,Z^ T ) ^ 1. In particular, it follows from @ that 

9 

d(E/i S ,E//M) < Ed(fi E ,fJ.M) < -• 

n 

This concludes the proof of Proposition [7] and of Theorem [TJ 
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2.4. Proof of Theorem [4], The concentration inequality ([5]) holds. It is thus sufficient to 
prove the analog of Proposition [7J If £ ^ 2, the proof is essentially unchanged. In step 1, the 
assumption ([3]) implies the existence of a sequence e = e(n) going to such that F(£) — > 1. 
Then, in step 4, it suffices to extend the Taylor expansion up to L 

For the case £ = 1 : in step 2, we perform directly the Taylor expansion around 2, for i ^ j 
we write - Xj\\ 2 ) = f{2) - 2f'{2)XfX j {l + o(l)). We then move directly to step 5. (As 

already pointed, this case is treated in [9]). 
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